This file generates plots of embeddings of pancreas and iPSC dataset with standard PCA+UMAP preprocessing pipeline from scanpy (which metadata covariates explain most of the variation found in these datasets? Does the embedding capture the pseudotime ordering estimated with RNA velocity analysis?)

Read in the count matrix into an AnnData object

Run PCA

Bonemarrow Dataset